Cognitively Inspired Real-Time Vision Core
نویسندگان
چکیده
We introduce a cognitively inspired novel binary image representation and utilize it for real-time operating computer vision core, which is capable of simultaneously detecting a specific object in an image, classifying an image region provided by an algorithm such as motion detection, and tracking multiple objects in a video. In this framework, hidden layer representations of binary receptive field neural networks are utilized to generate compact image representations for various classification based functionalities. Object detection is implemented by learning a classifier on hidden layer activities and performing sliding window based search on the image. Morever, a classification based object tracking algorithm is introduced that uses the proposed framework, whose tracking performance in standard datasets is shown to be comparable to the state of the art Email addresses: [email protected] (Ozgur Yilmaz), [email protected] (Ismail Ozsarac), [email protected] (Omer Gunay), [email protected] (Huseyin Ozkan) 1Corresponding author Preprint submitted to Journal of LTEX Templates May 7, 2015 techniques. We further propose several additional functionalities on the same core, such as sparse interest point extraction, salient motion detection, scene recognition. These set of capabilities in the arsenal, artificial vision is expected to perform the necessary fundamental operations in real time, paving a way for more complex inferences, such as geometric computations and cross-modal information fusion.
منابع مشابه
Self-calibrating Marker Tracking in 3D with Event-Based Vision Sensors
Following an object’s position relative to oneself is a fundamental functionality required in intelligent real-world interacting robotic systems. This paper presents a computationally efficient vision based 3D tracking system, which can ultimately operate in real-time on autonomous mobile robots in cluttered environments. At the core of the system, two neural inspired eventbased dynamic vision ...
متن کاملStereoscopic depth estimation for online vision systems
The human visual perception heavily depends on stereoscopic vision. By fusing the two views that our eyes provide, a 3-D sensation of the surrounding is generated. It is therefore natural to assume that machine vision systems also benefit from a comparable sense. A lot of work has been done in the area of machine stereo vision, but a severe drawback of today’s algorithms is that they either ach...
متن کاملFast Optical Flow Using Dense Inverse Search
Most recent works in optical flow extraction focus on the accuracy and neglect the time complexity. However, in real-life visual applications, such as tracking, activity detection and recognition, the time complexity is critical. We propose a solution with very low time complexity and competitive accuracy for the computation of dense optical flow. It consists of three parts: 1) inverse search f...
متن کاملSaccadic Search with Gabor features appliedto Eye Detection and Real { Time
The Gabor decomposition is a ubiquitous tool in computer vision. Nevertheless, it is generally considered computationally demanding for active vision applications. We suggest an attention{driven approach to feature detection inspired by the human saccadic system. A dramatic speedup is achieved by computing the Gabor decomposition only on the points of a sparse retinotopic grid. An oo{line eye d...
متن کاملRepeat before Forgetting: Spaced Repetition for Efficient and Effective Training of Neural Networks
We present a novel approach for training artificial neural networks. Our approach is inspired by broad evidence in psychology that shows human learners can learn efficiently and effectively by increasing intervals of time between subsequent reviews of previously learned materials (spaced repetition). We investigate the analogy between training neural models and findings in psychology about huma...
متن کامل